Regression of a data matrix on descriptors of both its rows and of its columns via latent variables: L-PLSR
نویسندگان
چکیده
A new approach is described, for extracting and visualising structures in a data matrix Y in light of additional information BOTH about the ROWS in Y, given in matrix X, AND about the COLUMNS in Y, given in matrix Z. The three matrices Z–Y–X may be envisioned as an “L-shape”; X(I × K) and Z(J × L) share no matrix size dimension, but are connected via Y(I × J ). A few linear combinations (components) are extracted from X and from Z, and their interactions are used for bi-linear modelling of Y, as well as for bi-linear modelling of X and Z themselves. The components are de?ned by singular value decomposition (SVD) of X′YZ. Two versions of the L-PLSR are described—using one single SVD for all components, or component-wise SVDs after deBation. The method is applied to the analysis of consumer liking data Y of six products assessed by 125 persons, in light of 10 other product descriptors X and 15 other person descriptors Z. Its performance is also checked on arti?cial data. c © 2003 Elsevier B.V. All rights reserved. ∗ Corresponding author. Matforsk, The Norwegian Food Research Institute, Oslov. 1, N-1432 Aas, Norway. Tel.: +47-64970100; fax: +47-6470333. E-mail address: [email protected] (H. Martens). 0167-9473/$ see front matter c © 2003 Elsevier B.V. All rights reserved. doi:10.1016/j.csda.2003.10.004 104 H. Martens et al. / Computational Statistics & Data Analysis 48 (2005) 103–123
منابع مشابه
Graph Matrix Completion in Presence of Outliers
Matrix completion problem has gathered a lot of attention in recent years. In the matrix completion problem, the goal is to recover a low-rank matrix from a subset of its entries. The graph matrix completion was introduced based on the fact that the relation between rows (or columns) of a matrix can be modeled as a graph structure. The graph matrix completion problem is formulated by adding the...
متن کاملInferring Latent Structure From Mixed Real and Categorical Relational Data
We consider analysis of relational data (a matrix), in which the rows correspond to subjects (e.g., people) and the columns correspond to attributes. The elements of the matrix may be a mix of real and categorical. Each subject and attribute is characterized by a latent binary feature vector, and an inferred matrix maps each row-column pair of binary feature vectors to an observed matrix elemen...
متن کاملSynoptic Analysis of Early Heat Waves in Northwest of Iran
Early heat waves are extreme events that cause heavy losses in plant and animal life and cause many social and economic problems for communities. The purpose of this study was to identify synoptic patterns and statistical analysis of preterm heat waves in northwestern Iran. To do this, the maximum daily temperature data of March 14th was used for fourteen synoptic stations in the northwest of t...
متن کاملProbabilistic Matrix Addition
We introduce Probabilistic Matrix Addition (PMA) for modeling real-valued data matrices by simultaneously capturing covariance structure among rows and among columns. PMA additively combines two latent matrices drawn from two Gaussian Processes respectively over rows and columns. The resulting joint distribution over the observed matrix does not factorize over entries, rows, or columns, and can...
متن کاملA Performance Assessment of Model Selection Criteria When the Number of Objects Is Much Larger than the Number of Variables in PLSR
Partial Least Squares Regression (PLSR) is a method for constructing predictive models when the variables are many and highly collinear. Its goal is to predict a set of response variables from a set of predictor variables. This prediction is achieved by extracting a set of orthogonal factors called latent variables from the predictor variables. This study investigated the performances of model ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 48 شماره
صفحات -
تاریخ انتشار 2005